# Screen Content Reasoning
Ferret UI Llama8b
Ferret-UI is the first multimodal large language model (MLLM) focused on user interfaces, built on Llama-3-8B, capable of performing complex UI tasks such as referencing, localization, and reasoning.
Image-to-Text
Transformers

F
jadechoghari
256
69
Ferret UI Gemma2b
Ferret-UI is the first multimodal large language model focused on user interfaces, built on Gemma-2B, specifically designed for UI referencing, localization, and reasoning tasks.
Image-to-Text
Transformers

F
jadechoghari
302
50
Featured Recommended AI Models